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1 Introduction 



The last decades have seen a very successful development of new concepts 
and methods for the investigation of low-dimensional dynamical systems. It is 
obviously worth while to draw inspiration from this low dimensional experience 
when studying the much more difficult and much less explored field of high and 
infinite dimensional dynamics, especially in the context of spatially extended 
systems. 

This strategy has motivated the construction of certain models of high dimen- 
sional dynamical systems which are particularly well suited for an analysis in 
the tradition of what has been done for low dimensional systems. One class 
of such models has been termed coupled map lattices [1]. Here, time as well as 
the spatial aspect of the system are discretised. The most commonly studied 
coupling form in such systems is a nearest-neighbour coupling. There exists 
a large number of numerical and phenomenological investigations about such 
systems, and they have been very useful for studying typical effects appearing 
in extended systems as well as for formulating theoretical concepts appropriate 
for the characterisation of such systems. 

However, for certain explicit calculations it is attractive to look at a different 
form of coupling, which is not local but global, and which gives each subsys- 
tem the same influence on every other subsystem [2]. Obviously, mean-fleld 
type approximations of models with local coupling lead to such a global cou- 
pling. These approximations are usually good for long range interactions [3] 
or for nearest-neighbour couplings on high-dimensional lattices [4,5], but all 
spatial characteristics of the system loose their meaning since all subsystems 
are treated equally independent on where they are. 

On the other hand, global coupling does not only appear as an approximation; 
there are many applications where one is interested in a global coupling of the 
subsystems right from the beginning. Typically, this is the case in systems for 
which a macroscopic (i.e. global) conserved quantity exists, which mediates the 
global coupling. Prominent examples of globally coupled systems (in the form 
of globally coupled differential equations) are arrays of Josephson junctions [6], 
laser arrays [7], multi-mode lasers [8], certain chemical reactions [9], interacting 
biological clocks [10], and models of neural activity [11]. 

In this paper we study the discrete-time version of globally coupled systems: 
globally coupled maps. 

Before formulating in the next section a more general deflnition of the type 
of systems which we want to consider, we give a typical example: a system of 
N globally coupled tent maps for the discrete time (index t) evolution of the 



subsystem states 



X 



N 
» _ f fji)\ ^ '^ \- f (Ji) 

t+1 



fa{xr) + ^jY.U^t). Z = l,...,iV 



with 



fa{x) = ^- a\xl l<a<2. (2) 



Here, the parameter /t has the meaning of a couphng strength, and a charac- 
terises the slope of the tent. 

A very rich variety of dynamical phenomena in the long term has been ob- 
served in this system (or in a system of globally coupled logistic maps), caused 
by the competition between a possibly chaotic local dynamics fa and the or- 
dering influence of the coupling term [2,12-19]. Depending on the values of the 
parameters k and a, the system can show complete synchronisation of all sub- 
systems, partial synchronisation with the formation of synchronised clusters, 
or turbulent, seemingly uncorrelated movement of all subsystems. In addition, 
multi-stability, often with infinitely many attractors, is a common feature. 

These phenomena seem to be typical for all globally coupled systems. 

A technical merit of global coupling is that it is relatively easy to analyse 
the thermodynamic limit A^ — ^ oo of the time evolution of globally averaged 
quantities like the mean field 

1 ^ 

ht = ^^T.fai4'^)- (3) 



In the thermodynamic limit, the time evolution of the mean-field can be ob- 
tained from the so called nonlinear Frobenius- Perron equation [12,13,15,20,21], 
an equation for probability densities pt{x) on the subsystem state space: 

««(.)= /%-/.M-«Mftl)P.Wdx, (4) 



where 

h[p\ '■= I fa{x)p{x) dx. (5) 



This equation has to be solved self-consistently. Here, the tent map (2) brings a 
special advantage: Because of the piecewise linearity of the map it is possible to 



approximate the densities pt{x) by piecewise constant functions, and this leads 
to a very accurate numerical procedure to follow the long-term dynamics of 
equations (4) and (5), see e. g. [22,23]. While for a truly turbulent, uncorrected 
behaviour of the subsystems one would expect to find a fixed point attractor 
for the dynamics of the densities, much more complicated attractors have been 
observed, hinting at some nontrivial, 'hidden' correlations in the system even 
in cases without any partial synchronisation. One example for such a non- 
trivial attractor is shown in Fig. 1, where ht+i is plotted versus hf for a = 1.65 
and K, = 0.27. Looking at this figure there appears to be a low-dimensional 
quasiperiodic attractor of the nonlinear Frobenius Perron dynamics. (More 
detailed calculations have shown that there is a fine structure in the time 
delay mean-field representation which cannot be resolved in Fig. 1.) 

During the last few years the nonlinear Frobenius Perron equation has been 
the subject of numerous numerical and theoretical investigations [24-32]. 

However, the temporal behaviour of the mean-field can be predicted correctly 
by the nonlinear Frobenius Perron dynamics only in the limit A^ — > oo. For 
large but finite A^ there are clear deviations of the mean-field from its limit 
behaviour. It is worth while to study in more detail the N -^ oo asymptotics 
of the long-term behaviour of the mean-field because of the following reasons: 

a) Numerical simulations of a system of globally coupled maps are bound to 
use a finite number A^ of maps. It is important to be able to estimate how 
large one has to choose A^ in such simulations in order to recognise the details 
of the Frobenius Perron dynamics. 

b) Globally coupled systems which are motivated by realistic examples consist 
of a finite number of subsystems. If one wants to use for their description the 
results obtained in the thermodynamic limit one has to understand in detail 
how the limit situation is approached as the number of subsystems grows. 

c) The nonlinear Frobenius Perron equation may produce features that are 
sensitive to the system size in the sense that they cannot be observed for fi- 
nite A^ at all, while other features will still be present. In order to be able to 
distinguish between sensitive and insensitive features it is crucial to under- 
stand the N ^ oo asymptotics that leads to the Frobenius Perron dynamics. 

It is easy to get a first idea about the way in which finite A^ changes the picture 
obtained in the thermodynamic limit by comparing Fig. 1 with the results of 
some numerical simulations with finite values of A^. Fig. 2 shows numerical 
simulations with a decreasing number of subsystems. One can see clearly, that 
the details of the attractor of the Frobenius Perron equation get washed out 
more and more, the fewer subsystems are involved. 

This picture resembles very much the changes in the appearance of attrac- 



tors caused by small external random perturbations; a decreasing number of 
subsystems here corresponds to an increasing noise strength there. 

The idea of describing deviations from the thermodynamic limit behaviour as 
stochastic perturbations is quite obvious, but it is much less easy to specify 
concretely which properties these stochastic perturbations must have. Pikovsky 
and Kurths [21] gave heuristic arguments based on the central limit theorem 
of how to add random perturbations to the Frobenius Perron dynamics in or- 
der to get a proper description of numerical simulations with finite A^. While 
this method works very well for special examples it lacks a solid theoretical 
justification, so that it is neither clear how widely it is applicable nor how to 
improve it. 

It is the aim of this article to develop an approach to understanding long-term 
deviations from the thermodynamic limit in globally coupled maps based on 
large deviations methods. This means that we concentrate on the rare but 
very influential events that in the case of very large but finite A^ render the 
behaviour of the coupled system perceptibly different from what the ther- 
modynamic limit predicts. The central object of this approach is a so called 
quasipotential. 

The notion of quasipotentials originated in the physical context from attempts 
to generalise the idea of a thermodynamic potential to nonequilibrium situ- 
ations (see [33] for a review). Its mathematical foundations were formulated 
by Freidlin and Wentzell [34]. Quasipotentials are asymptotic estimates — on 
a logarithmic scale — of invariant probability measures of stochastically per- 
turbed dynamical systems with decreasing noise strength. They have proven 
to be successful for characterising the stability and sensitivity of attractors 
with respect to small noise, for estimating noise-induced escape times and 
transition times between different attractors, and for deriving noise scaling 
laws which describe the influence of small noise on bifurcation scenarios; a 
very short and incomplete list of typical applications may help the reader to 
trace recent developments in this area: [35-40]. 

In constructing quasipotentials for the N -^ oo-asymptotics in globally cou- 
pled maps, a rigorous, abstract identification of finite- iV effects and random 
noise is established, and the above mentioned successes in applying weak-noise 
quasipotentials can hopefully be repeated for large- A^ quasipotentials. 

It should be mentioned that as a starting point for studying large deviations 
in globally coupled systems we do not use pure globally coupled deterministic 
maps like (1) but globally coupled discrete-time Markov processes (which can 
be thought of as stochastic perturbations of maps). This extension is interest- 
ing for applications and advantageous from a technical point of view. It is in- 
teresting to note that the corresponding 'noisy' version of the nonlinear Frobe- 



nius Perron equation (touched upon in [21,13]) is the discrete-time version of 
the nonhnear Fokker-Planck equation introduced by Desai and Zwanzig [41]. 
A quasipotential approach to finite- iV deviations from the nonhnear Fokker- 
Planck equation has been derived by Dawson and Gartner [42] and is closely 
related to the present approach. However, the discrete-time theory described 
in this paper is clearly conceptually less complicated and easier to use for 
concrete calculations. 

The plan of this article is as follows: In the next section we give a definition 
of exactly which type of coupled systems we want to investigate. In addition, 
that section serves to introduce our notation. 

Section 3 is the theoretical main part of the article. Subsection 3.1 deals with 
the time-evolution in the thermodynamic limit. In subsection 3.2, deviations 
from the thermodynamic limit in each single time step are studied. Subsection 
3.3 shows how to obtain asymptotic estimates for long-term deviations from 
the thermodynamic limit by putting together the information about the single 
time steps and thereby constructing a quasipotential on a space of probability 
measures. Since this 'full' quasipotential is a rather indigestible object, in 
subsection 3.4 contracted quasipotentials are introduced as a way of extracting 
information about the long-term behaviour of macroscopic variables. 

Section 4 turns to the question, how large- A^ quasipotentials can be computed. 
In particular, in subsection 4.1 a discrete-time Hamiltonian field theory with 
constraints is set up, emerging from a variational principle fulfilled by the 
contracted quasipotentials. While it does not seem to be possible to solve the 
corresponding Hamilton equations in general, one can introduce approxima- 
tions that lead to an algorithm which is suitable for numerical computations of 
contracted quasipotentials close to attractors of the dynamics of macroscopic 
variables. In subsection 4.2 the ideas and the approximation schemes worked 
out in this article are illustrated with a simple example of globally coupled 
linear maps with Gaussian noise. 

Finally, Section 5 sums up the statements of the paper and indicates in which 
directions ongoing and planned investigations are heading. 

An Appendix gives a self-contained proof of Theorem 2 (which is one of the 
main ingredients of the method developed in this article). Its first paragraphs 
can serve as an introduction into the language used for probabilistic arguments 
throughout this paper; more details on the relevant basic facts from probability 
theory can be found in introductory texts like [43,44]. 



2 Systems of globally coupled noisy maps 



The state of a system which consists of N coupled subsystenis, each being 
defined on some state space M, can be characterised by an A^-tuple 



X 



(x[^]«,...,x[^1(^))gM^, (6) 



which we call the configuration of the coupled system. Often it is not necessary 
to mention the index [A^] explicitely in the components of x^^', so that we 
usually write x*-*' instead of x^^'^'''. 

In this and the next section we assume that M is a compact metric space. 
(However, the assumption of compactness could be weakened at the cost of a 
much higher technical effort in most of our considerations.) The most common 
case studied in concrete examples is M = / C M, a closed interval of the real 
line. 

Coupling means generally that the subsystems do not follow an individual 
time evolution, but that the state of any subsystem at a later time depends 
on the states of other (and in the case of global coupling even of all) subsystems 
at an earlier time. We are interested in a form of global coupling which can 
be incorporated in a measure theoretic way and which is sometimes called 
democratic coupling since all elements have equal influence in it. 

We flrst deflne one of the central objects of our investigation, the empirical 
measure connected to a conflguration x'^^ : 



,[N] 1 



N 



4vV=l^E^.«eP(M). (7) 



Here, V{M) denotes the set of Borel probability measures [43] on the subsys- 
tem state space M, and 6x for a; G M is the Dirac measure concentrated on x 
which is deflned for all Borel sets A G M by Sx{A) = 1 ii x & A and Sx{A) = 
a X ^ A. 

The concept of empirical measures has its origin in statistical estimation, but 
here it is simply a means of counting how many subsystems are in which state 
for a given conflguration: li A C M and x'^^ is the state of the coupled system, 
then £\jI]{A) gives the fraction of subsystems with subsystem state in A. The 
density of the empirical measure (which for flnite A^ is deflned only in the 
distribution sense) has been called 'snapshot distribution' by other authors 
[13]. 



Now we define a deterministic temporal evolution of the coupled system with 
discrete time t G No by 



xU,=F{xf\£[ 



m. 



with a continuous function F : M x V{M) -^ M . Note that the subsystem 
states Xt carry two indices now, the upp 
and the lower one for denoting the time. 



states Xt carry two indices now, the upper one for numbering the subsystem 



When talking about continuity, we have to specify the topologies which we 
use. On the state space M this is of course the topology derived from the 
metric on M . On the space of probability measures V{M) we use (unless 
when mentioned otherwise) the topology of weak convergence; a sequence of 
probability measures (/i„) is said to converge weakly to fi if for all bounded 
continuous functions g : M ^W: 

g{x)fj,n{dx) ^^J g{x)fj,{dx); (9) 

M M 



we then write /i„ — > fi. 



w 

71— »00 ' 



Later we will implicitly make use of the fact, that there is a metric on V{M) 
(e. g., the so called Levy-Prohorov metric [43]) which is compatible with the 
topology of weak convergence, and that compactness of M implies compact- 
ness of P(M). 

It is useful to introduce a real nonnegative parameter k, as coupling strength: 
We study continuous families (F^ : M x V{M) -^ M) which have the property 
that Fq does not depend on its second argument: 

Fo(a;,/x) = /(x) (10) 



where / : M — ;► M is a continuous function. This means that for /t = all 
subsystems evolve independently of each other according to the map / whereas 
for K > a global coupling term supervenes the local dynamics. In this sense 
the time evolution (8) can be regarded as a system of globally coupled maps. 

The concept of empirical measures leads to a definition oi global oi macroscopic 
variables appropriate for globally coupled systems: A macroscopic variable is 
an observable which depends on the configuration of the coupled system only 
through its empirical measure. Important examples for M C M are the mean 
value 

m[^l(x[^l):=m(471,) (11) 



with 



and the mean field 



■m{fi) := / xfi{dx) (12) 

M 



/^[^l(x[^l):=M^IJj,) (13) 



with 

h{fx):=Jfix)fx{dx) (14) 

M 

of a configuration. 

The abstract form (8) of the time evolution contains the various concrete 
models of globally coupled maps which have been studied in the literature: 

Our introductory example (1) would correspond to 

F^{x, /i) = faix) + Kh{fl). (15) 

Here and in the other models the local map fa depends on some parameter a 
which usually changes the strength of the nonlinear character of fa- 

However, the choice 

F^{x, fx) = {I- K)fa{x) + Khin) (16) 

used by Kaneko [2] has the advantage that there is a built-in guarantee that 
the right hand side is in M as long as k, is in the interval [0, 1]. 

The coupling can act through the parameter a, too, like in 

F«(x,/i) = fao+Krh(p){x), (17) 

which has been studied by Pikovsky and Kurths [21]. 

As already explained in the introduction, in this article we study coupled 
systems with a stochastic time evolution, for which the deterministic behaviour 
of eq. (8) is a limit case. What we call a system of globally coupled noisy maps 



is a time discrete Markov process (X[ ) on M^ which is defined by its one- 
step transition probabihties that have the following form: 

N 

Prob{xS G A(^) X ... X A(^)|Xri = xt^l} = n g^(.« ,[-1 )(A«) (18) 



with A*-^-*, . . . , A*-^-* C M, where {Qx) is a family of probability measures in 
V{M), continuously parametrised by a; G M. 

One can think of the time evolution (18) as first a deterministic step according 
to (8) and then, independently for every subsystem, a stochastic perturbation 
distributed as described by the probability measures Q^- 

A concrete example in extension of (15) would be 

N 

^Si = /(^f) + ^E/(^i^'^) + ef\ (19) 



where the ^} are independent random variables. If priO i^ ^^e probability 
density of the random variable ^} , which generally depends on the unper- 
turbed state Y := /(xf ^) + f E^li /(^F^), then 

QYiA)= I py(Ode. (20) 

U-.Y+^eA} 



With the choice Qx = Sx the stochastic time evolution (18) reduces to the 
deterministic one (8). 

In order to interpret (18) as a small stochastic perturbation of (8), a non- 
negative parameter r] can be introduced as a noise strength in families (Q^^^) 
withgW^^^^. 

However, in this article we make no explicit use of the concept of noise 
strength; we just mention that concrete calculations are expected to be easier 
for small noise strengths and that work on the small noise limit — again using 
large deviation methods — has started [45]. 
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3 The stochastic process of empirical measures and its thermody- 
namic limit 



The stochastic process of configurations, X^ , characterised by eq. (18) in- 
duces a stochastic process of empirical measures, {£ ^l]), which is again a 

Markov process and therefore can be characterised by its one-step transition 
probability 

Prob{£:[^l, e 514^1 , = /i}, (21) 



where i? is a Borel subset of V{M). Note that since {£ [],]) is a process on 
V{M), the transition probabilities are elements of V{V{M)). 

In order to study the A^ — > oo behaviour of macroscopic variables it is enough 
to study the behaviour of the process of empirical measures in this so called 
thermodynamic limit, and this is what we will do now. 



3.1 Deterministic dynamics on V{M) in the thermodynamic limit 



The first observation is that by law-of-large-number type arguments the ther- 
modynamic limit of the empirical measure at some time t + 1 can be computed 
from that at time t. 

Theorem 1 Let (z/'^J) be a family of probability measures in V{M), and 
lyW -^ ly Then the conditional probability that 



Af— >oo 



<Ti, ^ H^) (22) 



given £ [mj = u^^' for all M = 1,2, .. . is equal to 1, where the map T : 
V{M) -^V{M) IS defined by 

(^(v9))(A)=|gp(.,^)(A)v9(dx). (23) 

M 



Theorem 1 means that in the thermodynamic limit the empirical measures 
follow a deterministic time evolution as described by the map T appearing in 
eq. (23). 
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Instead of giving a detailed proof of the theorem a few remarks will indicate 
here which arguments lead to the theorem, which is a harmless generalisation 
of a fundamental theorem of statistics, attributed to V. S. Varadarajan in [43]. 
An essential observation is that 

Prob{xifl(^-) e A\£% = Z.M} = (^(z.M))(A). (24) 



It is then a consequence of the strong law of large numbers that for every 
bounded function g : M ^W the expression 

1 A'" 

f9{y)S%{dy) = ^Y.9{xlW') (25) 



converges to 



M 



with conditional probability 1. 

The last step which completes the proof of Theorem 1 concerns the fact that 
the exceptional null-sets on which the above convergence does not hold true 
may depend on the function g. But standard topological arguments can be 
used to exclude the possibility that all the different null-sets — for all bounded 
continuous functions g — do not add up to a set of positive measure, so that 
the conditional probability for the weak convergence (22) is 1, indeed. 

In the case of globally coupled maps without noise, i. e. Qz = S^, eq. (23) re- 
duces to the nonlinear Frobenius Perron equation mentioned in the Introduc- 
tion, which has been introduced by Kaneko [13] and by Pikovski and Kurths 

[21]. 

Many — mostly numerical — studies of the nonlinear Frobenius Perron equa- 
tion and its noisy counterpart show that the map JF often has a very compli- 
cated attractor structure [21,13,14,22,23]. Periodic, quasiperiodic, and chaotic 
attractors of the dynamics (23) have been found and discussed in connec- 
tion with some counterintuitive observations about the temporal behaviour of 
macroscopic variables. Also, multi-stability with the simultaneous existence of 
many (possibly infinitely many) attractors is possible. 

Not surprisingly it is the nonlinearity of (23) which opens up the possibility 
of a much richer dynamical behaviour than what is known from the linear 
Frobenius Perron equation and its noisy counterpart (which is the limiting 
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case of (23) in which -F(x, (/?) depends on x only — and not on ip (uncoupled 
case)). 



3.2 Large deviations from the deterministic dynamics of empirical measures 
in one time step 



Theorem 1 describes the dynamics of the empirical measures in the thermody- 
namic limit deterministically by the map JF. By definition, for large but finite 
A^ the temporal evolution of {S [irj) will be close to that dynamics — at least 

on a short time scale. It is very desirable to make this statement more concrete 
and quantitative, because then we will be in a position to derive predictions 
about the behaviour of large finite systems from the properties of the map T. 

In the Introduction we gave an example of how the behaviour of a macroscopic 
variable for a large finite system looks like a stochastically perturbed version 
of its behaviour in the thermodynamic limit. Here, we trace this observation 
back to the level of empirical measures: The behaviour of S [jt] can be under- 

stood as the action of the map JF plus random perturbations with a strength 
that decreases with -^. These random perturbations have the so called large 
deviation property, which is essential for the methods we will apply to study 
their infiuence. 

First we recall what it means to say that probability measures have the large 
deviation property [46,47]. 

Consider a family of probability measures (P'^J) on some compact metric space 
S. This family is said to have the large deviation property with deviation rate 
/, which is a function / : 5 ^ [0, cxd] with compact level sets {s G iS : I{s) < 
B} for arbitrary B G [0, oo), if for all bounded continuous functions h : S ^M. 
the following limit holds: 

lime log /exp[-^]p["](ds) = - inf [/i(s) + /(s)]. (26) 

£^0 J £ S&S 

s 



To be precise, this property is called the Laplace principle [47] because of 
its obvious relation to the Laplace approximation (i.e. the "saddle point ap- 
proximation" for real valued integrals), but (at least in our setting) this is 
equivalent to the large deviation property in its original formulation. 

It is convenient and helpful for intuition to introduce the following symbolic 
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notation for the essence of (26): 



p[^](ds) X exp[-:^]ds. (27) 



Thus, the large deviation property can be characterised as a tool for estimating 
the probability of rare events: Roughly speaking, the sets A d S which do not 
contain points where the deviation rate is vanishing, have a probability that 
converges to zero exponentially fast as £ — > 0, and the quantity infsgyi/(s) 
gives the exponential rate of this convergence. 

In the context of random perturbations of dynamical systems, a discrete-time 
Markov process Sf with values in 5, parametrised by a perturbation strength 
e, is called a random perturbation of a dynamical system G : S ^ S with large 
deviation property, if there is a deviation rate s t-^ I{s\r) {s^r G S) for the 
transition probabilities ProbjS'i^i G ■\St = r} which is lower semi-continuous 
in r and has the property that I{s\r) = iff s = G{r). The simplest example of 
a dynamical system with random perturbations with large deviation property 
is a one-dimensional map {S = M) with additive white Gaussian noise with 
variance e: In this case, the deviation rate is 

I{s\r)=^-\s-G{r)\'. (28) 



Now we come back to the claim that {£ [irj) is a random perturbation of 

JF : V{M) — * V{M) with large deviation property. In order to specify the 
deviation rate for this process, which has to be a function on V{M) x V{M), 
we use a well known concept for measuring the dissimilarity of two measures 
in V{M), the relative entropy or Kullback-Leibler divergence [48]: 

For <f,ip& V{M), the relative entropy of (/? with respect to ip is defined as 



if ip is absolutely continuous with respect to ip, and H{ip\ilj) = oo otherwise. 

Note that H{ip\ilj) = iS ip = ip. In information theory the relative entropy 
is used as a measure for the information carried by (f under the hypothesis ip. 
It can be used loosely as a way to characterise how "far away" ip is from ip in 
V{M), although the relative entropy does not fulfil the formal definition of a 
distance (as the triangle inequality does not hold even after symmetrisation). 
Technically, the relative entropy is a lower semi-continuous function in its 
second argument and has compact level sets as a function of its first argument. 
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As we will see now, for our setting with S = V{M) it plays the same role in the 
deviation rate as the square of Euclidean distance in equation (28) (Gaussian 
noise and iS = M), which brings us to the central point of this subsection: 

Theorem 2 The process [8 [jt]) is a random perturbation of J-' : V{M) -^ 

V{M) with large deviation property. The noise strength is -^, and the deviation 
rate: 

Ii^\u)=Hi^\Tiu)). (30) 

This theorem is a close relative of one of the classical results of large deviation 
theory, the so called Sanov theorem [46]. In fact, it is a special case of a 
generalised Sanov theorem proved by Dawson and Gartner [42]. However, 
we will sketch a direct proof of Theorem 2, which requires less topological 
prerequisites than the approach of Dawson and Gartner, in the Appendix. 

In the symbolic notation of equation (27), the content of Theorem 2 reads like 
this: 

Prob |4^1j G dv 4^1, = z/I^l j X exp [-NH{^\T{u))] d^, (31) 



where u = w— lim z/' K 

N^oo 



3. 3 Large deviations from the deterministic dynamics of empirical measures 
in the long term 



Theorem 2 gives an asymptotic estimate for the one-step transition probability 



,[JV] 



Prob^^^;^, eB 






x^- 



[N] 



,[JV] 



of the process of empirical measures {S [^rj) (with B a Borel subset of P(M)). 

The best way to understand the long term behaviour of this process is to 
study invariant measures W^^^ E V{V{M)) of the process, which are implicitly 
defined by 

H^I^l (B) = I Prob {^]^^1, G B 1^]^^!, = /i } W^I^l ( d/x) (32) 

P(M) 



for all Borel subsets of V{M). 
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An invariant measure W^^'^ gives highest weight to those areas of V{M) where 
the empirical measure of the system's configuration can most hkely be found 
in the long term. 

From Theorem 2 we know that for large A^ the process {S [irj) behaves like 

a small random perturbation of the deterministic map JF, and therefore we 
expect that invariant measures W^^^ are concentrated near the attractors of 
JF. It is reasonable to assume that that the family (W^'^') itself has a large 
deviation property. 

Theorem 3 If a family {W^^^) of invariant measures of the processes {8 [jt]) 

has the large deviation property with noise strength j^ and deviation rate $ : 
V{M) — > [0, oo],(/9 h^ ^{v) (^h,e so called quasipotentialj, then 

$(^)= inf [i7(^|^(^)) + $(^)]. (33) 



Formally, equation (33) can be interpreted as the result of evaluating the 
exponential asymptotics of equation (32), namely 

exp[-iV$(^)] X / dipexp\-NH(■^lJ\J^(ip))]exp[-N^ip)], (34) 

P(M) 



by a Laplace approximation of the integral. 

Theorem 3 can be proved like follows: Since the deviation rate of a family of 
probability measures with large deviation property is a unique function, it is 
sufficient to show that for all bounded continuous functions h : V{M) -^ V{M) 
it is true that 



- ini M^) + ini [H{^\T{^) + $(v.)]] 
i>eV{M) ipeViM) 

= lim ^log / exp[-Nh{ij)]W^^\dij) . (35) 

V(M) 

But the left hand side of equation (35) can be written as 

- inf [<l>(v?)+ inf [H{ij\T{^) + h{ij)]l 



and an application of Theorem 2 shows that this is equal to 



lim inf 



HV) - ^ log / e-^^(^) Prob {^^^l, G d^ j^Tli = V^} 



V{M) 
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Since $ is the deviation rate for (l^I^l), equation (26) can be used to rewrite 
this last expression as 



iV- 

r{M) 



lim^ ^ log / exp N^ log / e'^^^'^) Prob j^^i, G d^ 






V(M) 



W^''\dip) 



= lim 4 log / exp[-Nh{ij)]W^^\dij), 
r(M) 

where in the last transformation we used equation (32). This completes the 
proof of Theorem 3. 

If we are able to find the quasipotential, i.e. the deviation rate for invariant 
measures of the process of empirical measures then we have all we need to know 
for exponential estimates of the long term behaviour of empirical measures and 
thereby of the long term behaviour of all global variables. We will come back 
to this point in the next subsection. 

Theorem 3 gives the key information how to find quasipotentials: Equation 
(33) is an eigenvalue equation in a generalised sense, and there exists a com- 
plete theory about how to find solutions to such equations. A very brief sum- 
mary of the relevant facts from that theory [49,50] will be given below. 

The concept of a quasipotential for an asymptotic description of the long term 
behaviour of stochastic perturbations of dynamical systems was introduced 
(under various names) by several authors in the context of low-dimensional 
diffusion processes with a small noise term. Early work on quasipotentials pub- 
lished in the physics literature was inspired by an analogy with semiclassical 
approximation in the path integral formalism of quantum mechanics, and by 
the fact that in the weak noise limit the Fokker-Planck equation can be linked 
to a Hamilton- Jacobi equation; for references see [33]. A rigorous mathematical 
formulation of the quasipotential method for these continuous-time situations 
was given by Wentzell and Freidlin [34]. Later, a version of the quasipotential 
method for the discrete-time systems was formulated [51,52], and this is what 
we need here. 

The usual Wentzell-Freidlin approach to quasipotentials does not start from 
the assumption that there is a family of invariant measures with the large 
deviation property (like Theorem 3) but rather shows its existence. However, 
the conditions used in the arguments of references [51,52] are not strictly 
fulfilled in the present case. Instead of going into the details of these arguments 
again with an adjusted set of preconditions, we pragmatically content ourselves 
here with showing how to find a quasipotential if it exists, which can be done 
by studying the algebraic consequences of Theorem 3 . The proofs of the 
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following statements can be found in [49] 

Define the least n- action (in [49] called the n-step transition pseudodensity) 
between xo e ^(M) and Xn G ViM) as 

n 

'S'nlXnlXo) := inf ^i/(xil^(Xi-i)) (36) 



and the least action (in [49] called the transitive closure of the transition 
pseudodensity) between ip G V{M) and ip G V{M) as 

5(^1^):= inf ^„(^|^). (37) 

n>l 



The name action for a sum of one-step deviation rates like in (36) alludes to 
the formal analogy to semi-classics and hints at the methods which we will 
apply for the calculations in the next section. 

Least actions are the crucial elements in the construction of solutions of equa- 
tion (33): 

First define the set i?^ C V{M) as the set of measures ip which have the 
following properties: 

(1) S{^\^) = 0. 

(2) If any measure ip G V{M) fulfils S{ip\ip) = 0, it is also true that S{tp\ip) = 
0. 

The set R^r is related to and in many cases identical to the union of attractors 
of the map !F; for details about this relation see [49] . 

Every solution of (33) can be written in the following form: 

<l>(^)= inf [S{cp\a) + c{a)], (38) 



where the coefficients c{a) are free parameters, if one is only interested in solv- 
ing (33). But since the quasipotential is the deviation rate for the invariant 
measures, the coefficients have to fulfil some additional equations that guar- 
antee compatibility with (32), and these equations fix the values c{a) up to an 
additive constant which can be set to zero. A graph theoretical method of how 
to determine the values c{a) when the least actions V{ilj\a) are known for all 
a G R^jr goes back to Wentzell and Freidlin [34] . We will not explain the details 
of this procedure, for which we refer to [34,51,52], as in a general context it 
is enough to know that the coefficients can be determined in principle, while 



the concrete example which we will study at the end of the next section, has 
a map T for which the only attractor is a stable fixed point a* . In this case 
we have obviously: 

$(^) = S{"^\a'). (39) 



3.J^ Large deviations from the deterministic dynamics of global variables in 
the long term 



The quasipotential $ describes the long term behaviour of empirical measures 
as A^ approaches infinity, or more precisely the probability for finding cer- 
tain empirical measures in the long time development of the system. From 
a practical point of view, however, it is cumbersome to observe probability 
distributions on the infinite dimensional space V{M) in large coupled sys- 
tems. Instead, it is usually more convenient to follow the time development of 
(low dimensional) global variables, i.e. variables that depend on the empirical 
measure through a continuous map 

G : V{M) -^ M^ (40) 



We have mentioned already two examples: mean value (12) and mean field 

(14). 

The invariant measure l^t^l of the process of empirical measures fixes a mea- 
sure WJf^ on G{V{M)) C R'': 

wI^\b) = W^[^]({v9 G V{M) : G{^) e B}) (41) 



for all Borel subsets B of G(V{M)). Wq describes the long term behaviour 
of the global variable G. 

The fact that (W^^^') has the large deviation property implies a large devia- 
tion property for {Wq ), too, and the deviation rate can be found through 
a theorem which is known as contraction principle in large deviation theory 

[46,47]: 

Theorem 4 The deviation rate of the family {Wq ) of probability measures 
describing the long term statistics of the global variable G, which we call 
contracted quasipotential (pc, can be derived in the following way from the 
quasipotential $.■ 
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for allueGiViM)). 

The proof of this theorem consists in the simple observation that the large 
deviation property (26) for (W^'^l) implies that for all bounded continuous 
functions he '■ G{V{M)) -^ M one has: 



lim ^log / exp\-NhG{G{tp))]W^^\dij) 



N 

P(M) 



inf [/.g(G(^)) + $W]. (43) 



But the left hand side of equation (43) is equal to 

lim i-log / exp[-NhG{u)]wi^\du) 



G(V{M)) 



whereas the right hand side is equal to 



inf \hGiu) + inf ^{ib)] 

u(iG(V(M)) G(iP)=u 



and this shows that the statement of Theorem 4 is true. 

Sometimes it is possible to recover part of the dynamical information lost 
during the 'projection' of elements of the infinite dimensional space V{M) to 
/c— dimensional values of the global variable by studying a time delay plot as 
mentioned in the Introduction. For instance, a two dimensional time delay plot 
of a one dimensional global variable G would mean that for some realization 
of the process (Xj ) one plots the points 

in a two dimensional plane. 

In the long term, the time delay plot of G is expected to show the set 

{(m, t;) G M X R\u = G{ip),v = G{J^{ip)), ^ G i^^^J (44) 

in the thermodynamic limit, where R^jri is one of the attractors of JF. 

For finite N, however, the points in the time delay plot will be distributed 
according to a probability measure Wq 2 on M x R. Again, the measures {Wq 2) 
have the large deviation property, and it follows from Theorem 2, Theorem 
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3, and equation (26) that the corresponding deviation rate, the time delay 
contracted quasipotential (f)G,2, can be calculated from the quasipotential $ in 
the following way: 

<l>G,2iu,v) = inf m^) + Hi^p\Ti^))]. (45) 

G((p)=u,G(p)=v 



This time delay contracted quasipotential predicts the outcome of time delay 
plots for large but finite A^. Its minimum or minima coincide with the set (44), 
and its level cuts (contour lines) in heights proportional to 1/A^ determine the 
regions of the plane in which points of the time delay plot are to be expected 
when N is finite. 

In the example of Fig. 1 , the time delay contracted quasipotential is expected 
to have a ring-shaped valley along the thermodynamic limit attractor and a 
hill in the middle of that ring, the height of which is proportional to the critical 
number of subsystems which is necessary in order to resolve the ring structure 
in a simulation. 



4 Approximative computation of contracted quasipotentials 



In the preceding section we have built the theoretical fundament for a large 
deviation type description of the N -^ oo asymptotics for the long term be- 
haviour of globally coupled maps. The central object in this description is the 
quasipotential; however, its computation is a complex and difficult minimi- 
sation problem over paths (i.e., sequences) in the infinite dimensional space 
V{M) of probability measures. This is a much more difficult task than the min- 
imisation problem that one has to solve in order to compute quasipotentials 
for low dimensional maps. Nevertheless, a Hamiltonian approach to the min- 
imisation problem, which has proved to be successful in the low-dimensional 
context, can be useful for the present situation, too. However, in order to reach 
at practical algorithms we do not simply copy the low dimensional procedure 
to treat the too complicated infinite-dimensional problem of computing ^(v?), 
but we describe a way to approximate computations of contracted quasipo- 
tentials (pdu), which are defined on a low-dimensional space. 



4-1 A Hamiltonian formalism for minimising actions 



For the following we assume that the map JF has only one attractor and that 
Xo is a point of this attractor. This simplifies the notation, and the general 
case of more than one attractor is complicated only by the fact that several 
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pieces of 'local' quasipotentials have to be stitched together like in equation 
(38). 

We concentrate now on the problem how to compute the contracted quasipo- 
tential 0^ ^oi a global variable G. According to equations (42), (39), (37), and 
(36) we have the following multiple minimisation problem: 

n 

<Pa{u)= rniini inf ^ i7(x,|^(x,-i)), (46) 

"■>1 G(Xn)=U Xlv:Xn-l ~' 



where all Xj ^-^e elements of V{M). 

It will turn out to be useful to introduce in an intermediate step constraints 
not only to the value of the global variable in the last state Xn but in all steps 
of the minimising sequence (Xj)'-- Defining for Mi, . . . , m„ G M'^ the constrained 
n— action 

n 

SG,n{ui,...,Un\xo):= ^^ luf ^ i7(Xj | J^(Xj-i)), (47) 

G{xi)=ui,--;G{xn)=u„ .-^ 



we can rewrite equation (46) as 



6g(m) = inf inf S'g,„(mi, • • • ,m„_i,m|xo)- (4J 

n>l Ml,.--,«n-l 



The advantage of the introduction of constrained n— actions is that the min- 
imisation problem of equation (48) looks very much like the problem of de- 
termining the quasipotentials of a low- dimensional (i.e., /c— dimensional) map, 
the only difference being that the n— action of a low- dimensional map has been 
replaced by a constrained n— action derived from an infinite dimensional map. 

This shows that the main new technical problem which we face is the compu- 
tation of constrained n— actions (47), while the step (48) can then be solved 
by standard methods. 

Equation (47) which defines the constrained n— actions can be reformulated 
as a Lagrangian minimisation problem; we do this here under the assumption 
that M is a compact interval in M and that the probability measure xo is 
absolutely continuous with respect to the Lebesgue measure on M so that Xo 
has a probability density po- 

In this case we can write 

n— 1 



S'g,„(mi, . . . , Unlxo) = ^min J2 ^«.,a,,m, [pj-i, Pj - Pj-i] (49) 
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where pi, . . . , p„ are integrable functions, and a Lagrange functional 



^u^AP^ v\ := \{p{x) + v{x)} log ^1;^^+^^^^^ dx (50) 



+ (A-i: 



{p(a;) + z^(a;)} da; — 1 



lm 



+ /i (Gfp + z/] - -u) 



has been introduced which contains Lagrange parameters (A — 1) and p. As 
usual all Lagrange parameters \j and /ij will finally be fixed so that the 
constraints 



Pj{x) da; = 1 (51) 

M 

and 

G\p,\ = u, (52) 

are fulfilled. Note that in equations (50) and (52) the symbols G and JF, 
which originally denoted maps on V{M), are now used for the corresponding 
functional on the space of densities. 

In order to find a minimising sequence of densities pi,...,pn that solves 
the minimisation problem (49) one could start to write down the variational 
derivatives -^-q-r of the sum in (49) which have to be equal to zero for the min- 
imising sequence. In this way one would be led to an implicit relation between 
Pj+i, Pj, and Pj^i, which is a discrete-time version of a Lagrange equation. 
However, this relation cannot simply be turned into an explicit iterative rule 
how to proceed from pj_i and pj to pj+i in a minimising sequence. 

The situation is easier to analyse in a Hamiltonian formulation which can 
be derived from the Lagrangian one if the global variable has the following 
structure 



G[p]= J ^{x)pix)dx, (53) 

M 

i.e., if it is obtained by averaging some integrable function '~f{x). This is the 
case for the examples: mean value (12) with 7 (a;) = x, and mean field (14) 
with 7(x) = f{x). 

By Legendre transforming (50) one arrives at the Hamiltonian 
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nu,xAp^ n]= I (^[p]) {y)e-(y)->^'^^(y) dy (54) 

7r(y)p(y) dy + (A- 1) +/iM. 



M 



Here, a new 'momentum field' 7r{x) has been introduced, so that now the 
minimising sequence {pi) is searched for together with a sequence (ttj). This 
will be done by using discrete time Hamilton equations: 



Pj+lix) - pj(x) = ^^^«,+i,A,+i,M,+i[P, Tt] 



p=p^,7r=7rj + i 



7r,+i(a;) - 7r,-(a;) = -— — 7^„^,^^_^^.^^_^^,^Jp, vr] 



which leads to: 






(55) 
(56) 



Pj+i{x) = J^[pj]{x) e-.+iW-^:>+i-A'.+i7W^ (57) 

7r,(x)= /e-^+^^^^-^^+^-^^+^'^fa) ^-^^^;^^ dy. (58) 

^ opj (x) 

Note, that a deterministic orbit of densities po, J-'[po], J-'[J-'[po]], . . ., which is 
a trivial minimising sequence, is a solution of equations (57) and (58) with 
Tij+i = 1, Aj+i = 1, and Pj+i = for all j. 

Still, equations (57) and (58) cannot be used for an explicit forward iteration 
which would generate nontrivial minimising sequences (pj) and (ttj), since (58) 
cannot be inverted to give ttj+i when knowing ttj and pj. 

Nevertheless, equation (57) reveals important insight into the structure of the 
problem. First of all, this equation shows that the one-step action H{pj+i\J^[pj]) 
has the following form: 

H{pj+i\J='[pj]) = / pj+i{x)7rj+i{x) dx - Aj+i - pj+iUj+i. (59) 

M 



Secondly, we can use (57) in the constraints (51) and (52) in order to see that 
the Lagrange parameters are determined by the following function: 

Z,(p, p):= I J^[p] (a;)e("(^)-^^(^)) dx : (60) 

M 
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/ij+1 is implicitly given by 

d 



"i+i = -^log^-.+i(Pi'/^) 



(61) 



while 

Xj+i = log Z^^.+, {pj,fij+i). (62) 

We note in passing that equation (58), too, can be expressed in terms of 
logZ^^(p,/x) 



^j i^) = j^ log z^^^^ (p, p,+i; 



(63) 
p=pj 



and that equations (61), (62), and (63) in equation (59) lead to an expres- 
sion for the constrained n-action (47) which involves the Legendre transform 
Qm (tt, u) of the function log Z,,- (p, p) : 



n—l 



'S'G,n(Ml, • • • ,Mn|Xo) = J2 ^t,+i (tTj, Mj+i) 

-logZ^,(po,pi) +piMi + y p„(a;)7r„(a;)da;. (64) 



M 



This remark shows that the present formalism can be interpreted as a gener- 
alisation of the classic Cramer's theorem which says that the deviation rate 
of the probability laws of mean values of independent identically distributed 
random variables is the Legendre transform of the logarithmic moment gen- 
erating function of those random variables. However, on the level of practical 
calculations, expression (64) does not offer any additional help. 

For any more explicit progress in evaluating the constrained n-action (47) it is 
necessary to resort to some approximations. First of all we expand 2^,^^(pj, p) 
in powers of p. Retaining only terms up to second order in p, we are led to 
approximate expressions for A^+i and p^+i. With the abbreviations 



n'. ■= / J^[p.] (a;)e"^+^(^) dx, (65) 

M 

u'.:=lj /7(a;)J^[pj](a;)e"^+i(^)da;, (66) 



■' M 
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V- ■■= It /(7(^))'^[p,](^)e-^+^(^) dx - {u'^)\ (67) 

■' M 



we obtain 

log Z^^^^ {pj , /i) ^ log n'j - fiu'j + ^fi'^v'j . (6J 



1 .2W 



2' 



From equation (61) we see now that 



r,' 



Uj^l — U 



N+1 ^ - ~, \ (69) 



^^ 



which means that the present approximation makes sense as long as -Uj+i stays 
close to -u', and from equation (62) that 



Uj+i — u', n 1 



^ ' v'^ \2 '^' ' 2 ^ 



A,+, ^ log n' + '^' ^ ( ^..,+1 + ^u' ] . (70) 



The corresponding approximation for the one-step action (59) is 



«-,' 1 2 



Hipj+i\J^[pj]) ~ y Pj+i(a;)7rj+i(a;) dx - logn^- + ^^^^^, ^ ■ (71) 

M 



J 



At this point it is useful to look back to equation (57) and to note that there 
are two ways in which p^+i deviates from J-'[pj]'. If the exponential factor on 
the right hand side of the equation would not contain an x-dependent itj^i{x) 
part, pj+i would be among all densities with the prescribed value -Uj+i for the 
global variable G that one which has smallest relative entropy with respect 
to J-'[pj]. The additional effect of nj+i{x) is to allow for the fact that for 
minimising the total n-step action it is usually advantageous not to try to 
minimise the one-step actions H{pj^i\J^[pj]) in all the single steps but to look 
at the overall effect of the choice of pj+i on the total n-step action. 

However, it is the non-trivial influence of ttj+i{x) that makes it so difficult 
to find a minimising sequence which fulfils equations (57) and (58). The sit- 
uation becomes much simpler if we neglect the infiuence of 'Kj^i{x) by just 
concentrating on eq. (57) and setting 

7rj(a;) = 1 for all j (72) 

in this equation. This is expected to lead to a reasonable approximation of the 
contracted quasipotential in the vicinity of the attractor in the dynamics of 
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the global variable G, since in this case a minimising sequence of densities pj 
should stay close to the attractor so that there is not much scope for reducing 
the action by changing the sequence through the influence of tTj. In any case, 
the approximate contracted quasipotential computed with the assumption (72) 
is an upper bound for the true contracted quasipotential. 

With (72) we obtain from equations (65) — (67): 



n;.^l, (73) 

^i -^'j= f 7(a;)^[Pi] (x) dx, (74) 

M 

v^ ^ v'^ = j{^{x)f:F[p,]{x) dx - {u'^f. (75) 

M 

From (71) we see that the one-step action along such a minimising sequence 
can be approximated as 

H{p,,r\nP^) - ^^^^^^- (76) 



Using this approximation for calculating the constrained n— action (47), we 
can go back to equation (48) and obtain 

0g(m) ~ inf inf Y ^ — -^^^. (77) 



We have written this equation in a form which resembles the well studied sit- 
uation of a quasipotential for the weak noise asymptotics of a low dimensional 
system with white Gaussian noise (x^+i = F{xf) + ^) [53,52]. In that case, a 
quasipotential $(a;) which describes the asymptotic behaviour of the system's 
invariant density can be computed as follows: 

$(x)=inf inf fk^J^^^^ziff, (78) 



where in the sum on the right hand side go is a point of the attractor of F, 
and g„ = x. 

Despite this similarity, there is a crucial difference between equation (77) and 
equation (78): The points v!^ which appear in equation (77) are not simply 
images of the points Uj under some low dimensional map F . Instead, in order 
to calculate v!^ one has to know the full density Py Therefore it is not enough 
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to look only at the sequence of points (uj) but one has to follow the sequence 
of densities (pj), which, however, is determined by equation (57) with (72). 
Unfortunately this means that in order to evaluate (77) one cannot simply 
use the known algorithms for computing quasipotentials according to (78). 
Nevertheless, solving equation (77) is much easier than solving the original 
problem (46), since the minimisation now requires only variation of a sequence 
of points instead of variation of a sequence of densities. 

With equation (77) we have found a generally applicable procedure how to 
compute an upper bound for contracted quasipotentials which can serve as a 
good estimate for the exact contracted quasipotential near attractors. 



4-2 A simple example 

In this subsection we study a very simple model system of globally coupled 
noisy maps. The purpose of this example is, on the one hand, to illustrate the 
definitions and computational procedures that were introduced in this article, 
and, on the other hand, to serve as a test for the quality of the approximation 
formula (77). 

The model system consists simply of linear maps on M: 

fa{x) = ax, — 1 < a < 1, (79) 



to which white Gaussian noise with a standard deviation a is added, and which 
are coupled through a function /c^ : M — ;► M of their mean field (where /t is a 
coupling strength and ko{h) = for all /i G M); this means that equation (8) 
has to be applied with the function 

F^{x, jj,) = fa{x) + k^{h{fi)) = ax + k^{h{p)) (80) 



(cf. (14)), and equation (23) becomes: 

v27ro" J 



The subsystem state space of this example is M rather than a compact space — 
in contrast to an assumption that was used for the derivations in the previous 
sections; the reason for choosing such a system is the simplicity of calculations 
which serves the purpose of illustrating the various steps in the construction 
of quasipotentials. Therefore we do not bother here to give proofs for the 
non-compact case. In practise it is not difficult (for sufficiently small coupling 
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strength) to modify the system in a way that the subsystems are restricted 
to a closed interval by cutting of the noise whenever a subsystem would leave 
that interval. Such a modification does not change the asymptotic behaviour 
and therefore it is convenient to use the original system for calculations. 

We denote a Gaussian probability density with mean value m and variance v 
by 

A4.,„(y) := -^e-^(^-'")^ (82) 



Now it is an important observation that Gaussian densities stay Gaussian 
under the influence of the temporal evolution (81). In fact, we have 

^[■^m,v] = ■^m,v (83) 



with 



fn = am + k^{am), (84) 

v = a^v + a^. (85) 



Equation (85) has a stable fixed point 

,2 



(86) 



1 — a 



If m* is any point of an attractor of (84), the density J\fm*,v* is a point of an 
attractor of JF. For a linear coupling, ki^{h) = nh, one has m* = 0, but for 
nonlinear couplings there may be different attractors of (84). 

We now want to find the contracted quasipotential (pfh for the global variable 
m, the mean value, i. e. 7(x) = a; in equation (53). Starting from the density 
Po = ■^m*,v*) one has to look for a minimising sequence of densities pj according 
to equation (57) and fulfilling the constraints (52) for a given sequence of mean 
values {uj). 

If we use the simplification (72), the minimising sequence turns out to be the 
sequence of Gaussian densities Mu,v*- Then, the approximation (77) for the 
contracted quasipotential reads like this: 

0m(^) ~ inf inf y^ l-u,- — ilTTiP, (87) 
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with Mo = ITT-*, Un = 1TL, where the connection between Uj and Uj is hke in 
equation (84). 

This means that in this case the approximate contracted quasipotential is 
— up to the constant factor -^ — equal to the quasipotential of the one- 
dimensional map m i— > am + k^lam) with Gaussian random perturbations, 
which can be calculated using the well established methods of [53] and [52]. 

In the case of linear coupling, /i;k(/i) = nh, this map is linear: m \-^ [1 + K,)am, 
and equation (87) can be solved explicitely: 

, / N 1 — (1 + /€)^a^ 9 , , 

0^(m) ^ "^——^m^ (88) 



This result can be checked through numerical simulations of the system of 
coupled maps (8) with (80): The data of a long time simulation of the N 
subsystem can be used for plotting a histogram Wjh{m) of the distribution 
of mean values, and Theorem 4 tells us that — -^logWm(^) should converge 
to the contracted quasipotential as A^ ^ oo. Typically, one expects that al- 
ready for large finite N the histogram —jj: log Wm{iTi) shows the shape of the 
quasipotential, shifted by an additive offset due to the pre-factor hidden in 
asymptotic relations like (27), which is only weakly dependent on m. 

Fig. 3 compares the approximation (88) for the contracted quasipotential with 
histograms obtained in simulations. Part a) shows the case /t = 0, a = of 
uncoupled identically distributed random variables, where the result 0m(^^) = 
2^m^ is not only a special case of our approximation (88) but a necessary 
consequence of the central limit theorem. The histogram contains the results 
of simulations with A^ = 10^ and a = 1, and its shape fits very well with the 
shifted contracted quasipotential, which is the solid line parabola in the figure. 

Part b) is for the less trivial case n = 0.4, a = 0.5. Here and in all the following 
examples we use again a = 1. The lower half of the figure shows a histogram 
for a simulation with A^ = 10^, the upper half with N = 10^. While the 
parabolic contracted quasipotential (88) gives a reasonable approximation to 
the histogram for A^ = 10^, it obviously works better for A^ = 10®. 

Looking at nonlinear coupling functions /i;k(/i) one can study more complicated 
mean field dynamics. With the choice 

K{h) = 10 k{1 - h - ah^) (89) 



with some nonlinearity parameter a one has for k = 0.1 the following mean 
field map: m i— *> 1 — aa'^m'^. This is the logistic map, and so the temporal evo- 
lution of the mean field can show all the dynamical behaviours that are known 
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for the logistic map. The approximation (87) for the contracted quasipoten- 
tial is the conventional quasipotential for the logistic map, which cannot be 
written down explicitely but can be solved numerically with high precision. 

Fig. 4 shows the results for this nonlinearly coupled system of noisy linear 
maps with a = 0.5, /t = 0.1 and a = 3.16. For this set of parameters the mean 
field map has a stable period 2. Part a) of the figure shows the approximative 
contracted quasipotential 0m(^^) with its two minima. Part b) contains two 
histograms for — j^ log Wmi^rn), the lower one with N = 10^ and the upper 
one with N = 10^. Part c) and d) show enlargements of these histograms in 
the neighbourhood of the left and right minimum, respectively, together with 
shifted portions of the contracted quasipotential (as solid lines) in order to 
demonstrate that again the approximate contracted quasipotentials correctly 
predict the shape of the histograms within the numerical accuracy. 

As a final example we look at the system with a = 0.5, k, = 0.1 and a = 7.92. 
In this case the mean field map is chaotic on an interval. We use this case 
as an example of how to apply time delay contracted quasipotentials (45). 
Fig. 5 a) shows the result of a simulation of this system with A^ = 3000 in 
the form of a time delay plot. The time delay contracted quasipotential in 
Fig. 5 b), which again has been obtained with the approximation (72), allows 
predictions about the blurring effect of finite A^ in numerical simulations of 
such systems. 



5 Conclusion and outlook 



The aim of this article was to develop a method for describing large deviations 
from the thermodynamic limit of systems of globally coupled noisy maps after 
the model of the quasipotential method for stochastically perturbed dynamical 

systems. 

On an abstract level this has been achieved by interpreting the time evolution 
in the thermodynamic limit as a consequence of a deterministic dynamical 
system on a space of probability measures (Theorem 1) and by recognising 
that the deviations from this thermodynamic limit behaviour in one time step 
follow a statistical distribution which allows exponential asymptotic estimates 
for large deviations (Theorem 2). These are the two ingredients necessary for 
constructing abstract quasipotentials on the space of probability measures 
(Theorem 3). 

These abstract quasipotentials contain the full information about large devia- 
tions from the thermodynamic limit in the long term. However, being defined 
on an infinite dimensional space of probability measures, they are rather in- 
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accessible through concrete calculations and ill-suited for directly interpreting 
the observed long-term distribution of macroscopic quantities in large finite 

systems. 

Therefore, the concept of contracted quasipotentials has been introduced (The- 
orem 4 and equation (45)) in order to extract from the abstract quasipotential 
information about concrete and observable characteristics of the temporal evo- 
lution of macroscopic quantities. 

The Hamiltonian approach discussed in Section 4 makes it clear how difficult 
it is to solve the variational problem behind the quasipotentials . On the 
other hand it opens the way for approximations which turn the problem into 
a manageable one. In particular, the approximation (77) is appropriate in the 
vicinity of attractors of the macroscopic dynamics. 

The simple example with linearly and nonlinearly coupled linear maps shows 
that this approximation gives a correct description of the probability of rare 
deviations from the thermodynamic limit close to an attractor. An investiga- 
tion of the much more interesting situation of coupled nonlinear maps like the 
tent map or the logistic map, which requires the implementation of a new nu- 
merical scheme, is in progress and is expected to lead to a correct description 
of the blurring effect of finite system size on the attractors of the macroscopic 
dynamics in such systems. In addition, this approach will offer a new answer 
to the question under which circumstances the invariant density, which is a 
fixed point of the nonlinear Frobenius-Perron equation, can loose stability. It 
will be interesting to compare this new criterion to the various statements 
obtained by linear stability analysis of the Frobenius Perron dynamics, which 
depend on which norm is used for measuring the smallness of perturbations 
[25-27,29,30,22,31,32]. 

However, as the approximation (77) is limited to the vicinity of attractors, 
it is desirable to go beyond the approximation (72) of equation (58) so that 
it is possible to calculate global features of quasipotentials like quasipotential 
barriers between different attractors. This is probably not possible for the 
general case, but for special choices of T there is some hope of being able to 
simplify equation (58) — for instance if the maps -Fk(x, [i) are piecewise linear 
in X and the noise influence in T is weak (i. e., if Qx is close to b^ 



Jx) 



In view of this problematic situation with the evaluation of quasipotentials 
it is consoling, that there is a class of systems for which the computation of 
quasipotentials for flnite-iV effects in globally coupled systems is not more 
difficult than the computation of quasipotentials for weak noise effects in low- 
dimensional dynamical systems. This class of systems consists of coupled sub- 
systems which are defined on a finite state space M. More concretely, if M con- 
sists of m elements, empirical measures can be written as [m — 1) -dimensional 



32 



vectors, so that T defines a (m — l)-dimensional dynamical system, and the 
quasipotential $ is defined on a (m — l)-dimensional space. While such sys- 
tems with a finite discrete subsystem space are not directly motivated by 
realistic physical systems, they can be studied as conceptually simplest proto- 
types for globally coupled noisy systems that show non-trivial effects such as 
noise induced phase transitions when A^ is finite, which can be analysed using 
quasipotentials. Results in this direction will be published elsewhere [54]. 
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A Appendix 



This Appendix gives details of a proof of Theorem 2. The reader who is not 
famihar with the mathematics hterature on large deviations will find this the- 
orem less intuitive than most other statements in this article, and therefore 
— instead of referring to related results in [42] — we will try to use a fairly 
self-contained line of arguments here, following the proof of Sanov's theorem 
in [47] , but without making explicit use of stochastic control theory. 

We try to use a language which keeps close to the standard conventions of 
probability theory [43,44], but at the same time makes the formulae read- 
able even without a full technical understanding of mathematical details. In 
particular, all probabilities about which we talk are derived from a proba- 
bility measure Prob : A -^ Wq on a measure space {Q,A), where ^ is a 
sigma-algebra of events, which are subsets of Q. A random variable R with 
state space (M, Bm) (where Bm is a sigma algebra of (Borel) subsets of M) 
is a measurable function R : Q —^ M, and Prob{i? G A} for A C M is a 
shorthand notation for Prob({c(J G Q : R{uj) G A}). We write Probji? G ■} 
for the probability measure on M induced by R, and integrals over functions 
f : M ^ W with respect to this measure as /^ /(m)Prob{-R G dm}. If R 
and S are two random variables on state spaces Mr and Ms, then we write 
the conditional probability that i? is in A conditioned on S* = s, s G Mg, as 
Prob{i? E A\S = s}. This object can be interpreted as a measurable function 
s 1-^ Prob{i? E A\S = s} with the property 

/Prob{i? G A\S = s}Froh{S G ds} = Prob{i? e A and S E B} (A.l) 



(which is not unique since there are several functions with this property, but 
they differ only on sets of ProbjS* G -j-measure zero). The topological assump- 
tions of this paper guarantee that for every fixed s E S, Prob{i? G -IS* = s} is 
a probability measure. 

Before starting with the proof it is useful to see an important property of the 
relative entropy: 

Let g : M ^ W be a bounded measurable function and 9 E V{M). Then the 
following variational formula holds true: 

- log f e"»(™)^( dm) = inf J H{x\0) + / g{m)x{ dm) I . (A.2) 

M { M } 



In order to see that this is true, it is sufficient to check the right hand side of 
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(A. 2) for measures x which are absolutely continuous with respect to 9^ since 
otherwise if (xl^*) = oo. 



A density 






defines a measure xo G "PiM), and obviously a measure x which is absolutely 
continuous with respect to 9 is absolutely continuous with respect to Xoy too. 
Therefore we can write (cf. eq. (29)): 

nm = / (log ^) d, = / (log i|) d, + / (log ^) dx. 



M ^ ' M ^ ' M 



and this means because of A. 3: 

H{x\0) + fg{m)x{dm) = H{x\Xo) - log f e-^^^^9{dm). (A.4) 



M M 



Since H{x\Xo) ^ 0? with equality exactly for the case x = Xo, the variational 
formula (A. 2) is true. 

Now we turn to the proof of Theorem 2. We introduce the following abbrevi- 
ation: 

Vn := -^log / e-^'^-^Froh{S% E dv^l^'^l, = u^""^}. (A.5) 

V{M) 



With this notation, Theorem 2 states (see eq. (26)): 

hm Vr, = inf [H{^\Hy)) + h{^)] . (A.6) 

N^oo ipeV(M) 



In order to show this statement, we will first derive a different representation 
of Vat. The construction of the new representation starts with building up the 
empirical measure £ [n] step by step: If we define 

4tl := (A.7) 

then the following iteration: 

& ■■= ^% + ^'^.o+^) (A-8) 
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for j = 0, . . . , iV — 1 leads to S [J,]. Note that £ %] is an element of the space 
of measures ip on M which have the property iIj{M) = j^, and this space we 
denote by Pi (M). 

AT 

Now we want to define an iterative scheme that leads to Vjv- The intermediate 
steps involve the following quantities (with j = 0, . . . ,N and ip G 'Pj_{M)): 



Vn 



IM] ._ 



4 log / e-^^(-)Prob{4Tl, Gdv.|£:[^j., =^andA=z.[^l}. (A.9) 

V{M) 

The two extreme cases of this definition are 

4°I°1 = IV (A. 10) 

and 

yfi-] = M^) (A.11) 

for V? G V{M). 

In search for an iterative connection between these two cases we write down 
the following sequence of equations: 



.AryUI'/'] 



/ e-^^(-)Prob{4^1, G MS% =^and4T^, =z.[^l} 
I) 

J i J e-^'^(-)Prob{4Tl, 

i(M) \piM) 



V(M) 

Gdv.|^|^'+;^=^and^[% = z.[^]} 



VToh{£lt^^ G d(p\£% = ij and S% = z/l^'} 



e-^v^r^'^'Prob{^[;S' e d^|^[;-|^, = ^ and ^^1, = z.[^]} 

N 

M 

e--^^'"^*'(^(.t-l))(d,) 

M 

where in the last step we have used eq. (24). If we take the logarithm of this 
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equation and apply the variational formula (A. 2) with the settings 6 = J-'^u^^^] 
and g{m) = NVj^ '^ "" then we obtain 



4^'l'^l= inf 



M 



(A.12) 



In the light of [49] this can be called a stochastic possibilistic dynamics. If 
we start from V^y = h{il)) and iterate eq. (A.12) A^ times we arrive at the 
following equation: 



Vr 



N 



inf 

ipi,...,^ive-p(M) 



{1 TV r r ^1 

- Y. ^(v^.i-^i^f'^i)) + y • • • y K4^-i) n vA d%) ■ (A.13) 



This representation of Vn is very convenient for showing half of the statement 
(A. 6) of Theorem 2, namely 

limsuplV< inf [H{^\J^{v)) + h{^)]. (A.14) 



This inequality follows from eq. (A.13) in the following way: It is a consequence 
of (A.13) that 



M M ^-^ } 



^^ - JvL \ ^(^l-^^^^"")) + y • • • / ^ {€m ) n V^( d%) \ ■ (A.15) 

M 



Using the facts that z/^^' converges weakly to u and £^-[1] converges weakly to 
ip with probability 1, this inequality leads directly to (A.14). 

Now the only missing part in the proof of Theorem 2 is the second half of 
(A. 6), namely 

liminflV> ini [Hi^\Tiu)) + hi^)]. (A.16) 



Unfortunately this requires more work than (A.14), and we have to use some 
non-trivial but standard results from probability theory which can be found 
in introductory texts like [43,44]. 

First of all we observe that the convexity of H allows us to use Jensen's 
inequality in (A.13), so that 
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Vn> 

inf 

^i,...,</pjveP(M) 



I ^ «=i 'mm i=i J 



Introducing the abbreviation 



1 ^ 
"^-TfE^* (A-18) 



we can rewrite the content of (A. 17): 

For every e > and every A^ G N there is a sequence ((y^i, . . . , (/9jv) of elements 
in V{M) such that 

r r ^ 

V^ + e>J ...J [/f(a^|^(z.[^l)) + /. (^^i,)] n vMVj)- (A.19) 



We are going to estabhsh a connection between the measures a^ in the first 
term and the empirical measures in the second term of the above sum: 

Let y*^*^ be a random variable with values in M and Prob(y*^*^ E A) = (^i{A). 
li g : M ^ W is bounded and continuous, then we find that the sequence of 
differences 



J g{x)S^^i]{dx) - J g{x)aN{dx) = ^E aiY^^^) - J 9{x)<^j{dx) 



M M 3-^ 



M 



(A.20) 



converges to with probability 1 as A^ — ^ oo, due to the strong law of large 
numbers for non-identically distributed orthogonal random variables [44]. 

Now consider an arbitrary infinite subsequence (ciArJ of the sequence (ttAr). 
Since V{M) is compact, the subsequence has a subsubsequence {a^i ) that 
converges to some a* G V{M). For notational simplicity we write a^i ='■ tt(fc)- 
The convergence of the difference (A.20) tells us then, that with probability 1 

(7(x)4S),(dx) ^ fg{x)a4dx). (A.21) 



M M 



So far the almost sure convergence in (A.21) holds true only after fixing g. 
In order to conclude from (A.21) that ^[(i)] converges weakly to a* with 
probability 1, it is sufficient to know that there is a countable dense subset 
of the space of bounded, uniformly continuous functions (with regard to an 
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appropriate metric), but this follows from a standard separability result, see 
for instance Corollary 11.2.5 in [43]. 

Combining these convergence results with the continuity of h and the lower 
semi continuity of H we know now that almost surely 



lim inf 

fc— >oo 



if(a(,)|^(z/[W])) + /i(4S,)] > [H{a^H^)) + h{a,)].{k.22) 



Making use of Fatou's lemma this means together with (A. 19) that for all 
e > 0: 

lim inf V(fe) +e> [H{a,\J'{u)) + h (a,)] > inf [H{^\J^{u)) + h{^)] .(A.23) 

k-^co ip(i'P{M) 



This is nearly the statement (A. 16) except that it refers not to the whole se- 
quence Vat but to a subsubsequence V{k) . However, it has been shown that every 
subsequence of Vat has such a subsubsequence. Now assume that eq. (A. 16) 
were not true. Then for sufficiently small e > there would be an infinite 
sequence of Vat- smaller than inf(pgp(M) [H{'^\T{i')) + h{ip)] — e, and this sub- 
sequence would not have a subsubsequence which fulfils eq. (A.23). This con- 
tradiction shows that (A. 16) must be true, and the proof of Theorem 2 is 
complete. 
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Figure captions 

Fig. 1: Time delay plot of the mean field ht+i = h[pt+i] versus ht = h[pt] as 
obtained by numerically solving the nonlinear Frobenius-Perron equation 
(4) for coupled tent maps (2) with a = 1.65 and k, = 0.27. 

Fig. 2: Time delay plot of the mean field ht+i versus ht for the same pa- 
rameters as Fig. 1, but this time obtained by iterating equation (1) and 
using equation (3) for finite systems with decreasing size: a) A^ = 10^, b) 
N = 10^, c) N = 10^. In each case the plots show 2000 points after initial 
transients have died out. 

Fig. 3: The contracted quasipotential 0m,(^^) of the system (80), (81) with 
linear coupling, compared to histogramms of the invariant density WmiiT^) 
obtained from numerical simulations: a) with parameters k = 0, a = 0, 
a = 1, N = 10^ over 10000 time steps, b) with parameters n = 0.4, a = 0.5, 
(7=1, and with A^ = 10^ (lower part) and N = 10^ over 10000 time steps. In 
all cases the logarithmically plotted histogramms can be approximated by 
the contracted quasipotential (solid line) after a vertical shift by a constant 
offset. 

Fig. 4: Comparison of invariant density and contracted quasipotential for the 
system (80), (81) with nonlinear coupling (89). The parameters are k, = 0.1, 
a = 0.5, and a = 3.16; in this case, the nonlinear Frobenius-Perron dy- 
namics predicts a period-2-attractor for the mean value. The various parts 
show a) the contracted quasipotential (j)rh{jn), b) a logarithmic plot of the 
histogramm for the invariant density Wmifn) from a simulation of 10000 
time steps with A^ = 10^ (lower histogramm) and A^ = 10^ (upper his- 
togramm). Parts c) and d) enlarge the vicinity of the left periodic point 
and the right periodic point, respectively, to show the agreement between 
the histogramms and the shifted contracted quasipotials (solid lines). 

Fig. 5: a) Time delay map, rrit+i versus rrit for the system (80), (80) with 
nonlinear coupling (89) and parameters k = 0.1, a = 0.5, and a = 7.92, 
obtained from a simulation with A^ = 3000 over 10000 time steps, b) Time 
delay contracted quasipotential 0m,2(^^, fn')- The contour lines for the levels 
of 0.002, 0.004, and 0.006 in the quasipotential follow the shape of the 
blurred attractor in a). 
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